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Abstract 

The importance of renewable power production is a set goal in terms of 
the energy turnaround. Developing short-term wind speed forecasting 
improvements might increase the profitability of wind power. This ar¬ 
ticle compares two novel approaches to model and predict wind speed. 
Both approaches incorporate periodic interactions, whereas the first 
model uses Fourier series to model the periodicity. The second model 
takes ? 2 ,p generalised trigonometric functions into consideration. The 
aforementioned Fourier series are special types of the p-generalised 
trigonometrical function and therefore model 1 is nested in model 2. 
The two models use an ARFIMA-APARCH process to cover the au¬ 
tocorrelation and the heteroscedasticity. A data set which consist of 
10 minute data collected at four stations at the German-Polish border 
from August 2007 to December 2012 is analysed. The most impor¬ 
tant finding is an enhancement of the forecasting accuracy up to three 
hours that is directly related to our new short-term forecasting model. 
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1 Introduction 


Fossil fuels like oil, gas and coal are rare resources and it is well known that 
they are finite. The catastrophe in Fukushima (Japan) put a negative light 
onto nuclear energy and forced people to reconsider nuclear power plants. 
One of the most emergent renewable energy sources is the wind power. [22] 
report that the worldwide wind capacity grew by 5.5% from the end of 2013 to 
June 2014. Moreover, they state that this capacity generates approximately 
4% of the worldwide electricity supply. The electricity sector of the European 
Union (EU) tends towards wind energy, as [21] remark. Besides, they observe 
that within the last 15 years, 28% of the newly installed power production 
capacity stem from wind power. The leading wind power country in the 
European Union is Germany, which has an installed capacity of 36.488 MW 
by June 2014 [22], [21] End that fifteen EU countries have an installed 

capacity of wind power of each > 1.000 MW. Poland and Romania are more 
recent EU members which also exceed the threshold of 1.000 MW [21]. The 
installed capacity in Poland amounts to 3.727 MW [22], The geographical 
region at the German-Polish border is very much alike, but the installed 
capacity varies significantly, as [22] report. 

The profit of wind power strongly depends on several properties, as [4] point 
out, but the focus of this article is the wind speed prediction. The wind 
speed is an essential part of the wind power. Moreover, accurate wind fore¬ 
casts can be used to optimise the overall electricity supply, particularly the 
feed-in of wind power. Developing new wind forecasting models depends 
on the previously specified purpose and forecasting horizon. [19], [23] as 
well as [12] provide a broad review of differing wind speed and wind power 
prediction models. As [12] point out, conventional time series models are 
easy to formulate and often used for short-term predictions. The wind speed 
possesses different properties like periodicity, persistence, heteroscedastic- 
ity as well as further characteristics. [9] show a wind speed forecasting 
approach which contains an autoregressive fractionally integrated moving 
average (ARFIMA) process. It is one of the first models that comprise a 
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long memory process. Since their contribution, the ARFIMA model has be¬ 
come very common to model the wind. Hence, [20] examine extensions of 
the approach. They compare an ARFIMA model with generalised autore¬ 
gressive conditional heteroscedasticity (GARCH) and periodic regressors to 
calibrated and smoothed ensemble-based wind forecasts for hourly data. The 
periodic behaviour of wind speed depends on the temporal resolution of the 
underlying data set. Frequently, such periodicities are modelled by Fourier 
series [20, 2], Altogether, miscellaneous articles report that wind speed is au- 
tocorrelated and shows periodic behaviour. The wind speed characteristics 
are commonly combined in several ways. [11] implement a model which is 
based on Bayesian theory and structural breaks to implicate these properties. 
[1] cover a comparison of different long range dependence models with time 
varying regressors and different estimation methods. A further characteristic 
is given by the heteroscedasticity of wind. [2] use a combination of ARMA 
models with periodic regressors. Furthermore, they introduce Fourier se¬ 
ries for the variance part to model the heteroscedasticity. [1] find that their 
data set contains asymmetric conditional heteroscedasticity. Certainly, this 
is only a brief overview of prevalent wind characteristics. [10] incorporate 
spatial information within their forecasting models to extend the accuracy. 
They perform a comparison of a regime switching model and a bivariate 
skew-t model. [24] develop a generalisation of the regime switching approach 
which is introduced by [8]. The generalised regime switching space-time 
model includes several wind characteristics. Our model covers periodicity, 
persistence and heteroscedasticity and thus includes information to provide 
suitable short-term forecasts. 

As [23] point out, those forecasting horizons are beneficial for dispatch plan¬ 
ning and for economic load decisions. This article contains two wind speed 
prediction approaches. Both models use an autoregressive fractionally in¬ 
tegrated moving average (ARFIMA) process with periodic regressors and 
conditional heteroscedasticity among the variance. Particularly, the two ap¬ 
proaches incorporate an asymmetric power generalised autoregressive condi¬ 
tional heteroscedasticity (APARCH) process within the conditional variance 
[5]. Moreover, both models take annual and diurnal interactions into account. 
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The difference of the two approaches is linked to the periodic mean part in 
addition to the autocorrelation. The first model uses the classical sine and 
cosine Fourier series. The second model contains / 2 . p generalised trigono¬ 
metric functions [?, see]]richter2007generalized to model seasonal changes. 
The analysed data set consists of 10 minute wind speed observations. This 
data is used to fit the model and perform predictions up to three hours at 
four different stations at the German-Polish border. The forecasting results 
are evaluated by several accuracy measures. Furthermore, a loss function as 
introduced by [10] is taken into consideration. The importance of this loss 
function is underlined by [23]. 

In Section 2, the general model for our wind speed is introduced. Following 
the theoretical content, we discuss the estimation of the model. Section 3 
depicts several methods to evaluate the forecasting accuracy of the selected 
models. Finally, Section 4 draws a conclusion of the results discussed in this 
paper. 


2 Description of two wind speed models and 
their in-sample properties 

The contemplated wind speed data set is recorded at four stations in the state 
of Brandenburg. We consider high-frequency data (10 minute) which is pro¬ 
vided by the “Deutscher Wetterdienst” (DWD). Brandenburg is Germany’s 
fifth biggest territorial area. The stations are situated near the Polish border 
in Manschnow (52°33 'N 14°32 ’E), Lindenberg (52°13W 14°07 'E), Anger- 
miinde (53°02'A r 14 o 00'A) and Griinow (53°19W 13°56 / £'). The wind speed 
data consists of approximately 280,000 observations within a time frame 
from August 2007 to December 2012, with 0.1% missing values. The gaps of 
missing data are not longer than a few hours in all cases. We take several 
interpolations into consideration, but they provide nearly identical results. 
Therefore, we decide to use linear interpolation, as it is the simplest and the 
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least computationally intensive procedure. The in-sample model calibration 
starts in August 1, 2007 and reaches to December 31, 2011. 

The considered high-frequency wind data exhibit several of the aforemen¬ 
tioned wind speed characteristics. Figure 1 shows a sample of the wind 
speed observations and the corresponding histogram of station Lindenberg. 
We observe that the wind is rapidly changing and involves a high correlation 
structure, as shown in Figure 2. This figure shows the autocorcllation func¬ 
tion up to lag 5, 000 which corresponds to more than one month. The other 
measurement stations provide a similar characteristics. 

[Figure 1 and 2 about here.] 

The periodic characteristic of wind depends on the frequency of the observed 
data. In particular, we analyse the spectral density to find all necessary 
periods. The basic tool for these inferences is the periodogram, which is 
supposed to be the sample analogue of the spectrum. A detailed descrip¬ 
tion of this is given by [3]. Figure 3 shows the smoothed periodogram by 
using the Fast Fourier Transformation (FFT). The peaks included in this 
picture clearly depict a periodic behaviour for different frequencies In 
general, we observe Si,...,sjv £ R periodicities with si > ... > s n and 
Sn = 1 /con- Our choice of the peak depends on the period s G (si,..., Sn} 
on the one hand and on the total value of the estimated spectrum on the 
other hand. Near the point of origin we detect periods of one year and more. 
Bearing in mind that we analyse a data set of about five and a half years, 
we neglect peaks with periodic cycles of more than one year. Altogether, we 
consider at least four periodicities. The selected peaks correspond to diurnal 
si = 1/.000019 = 52560 and annual s 2 = 1/.00694 = 144 periodicities. In 
addition, we consider two sub-sets of the previous determined frequencies, 
i.e. half-day (si/2 = 1/.0138) and half-year (s 2 /2 = 1/.000038) periodicity. 
Such frequencies reflect the day-night cycle and the summer and winter term. 

[Figure 3 about here.] 
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Thus, we observe persistence, periodicity and heteroscedasticity within onr 
wind speed data similar to the approaches by [20], [2] and [9]. Apparently, 
these models use different data sets. For example [6] implement an ARMA- 
GARCH in mean model to their 15 minute wind data set. [20] use hourly and 
[2] use three hour wind data. We apply the aforementioned approaches to our 
data, but they do not provide an overall satisfying result. For instance, [2] 
take periodic regressors within the variance part into consideration, whereas 
our variance data shows an asymmetric conditional heteroscedasticity. Such 
a result is related to the underlying data set and therefore, we have to develop 
another approach. We provide two novel wind speed models which use a new 
method to capture the periodic behaviour of wind. 

2.1 Model description 

Let {W t } be a high-frequency process of wind speed, measured in m/s. The 
univariate time series of wind speed measurements is given by l'Fi44(r-i)+o = 
W t , where r G {l,...,Af} is the day, o G {1,..., 144} is the 10 minute data 
point within a day. We are able to generalise the model [?, see]]Brockwell2009 

Wt = Intercept + Trend t + Seasonal t + e t , (1) 

where {e f } is the residual process with zero mean and time dependent vari¬ 
ance {(T t }. The residual process incorporates autoregressive coefficients and 
conditional heteroscedasticity within the variance. The general model (1) is 
the most simplified expression of the wind speed time series. Hereafter, we 
explain the seasonal regressors in more detail. 


Trendt + Intercept 
Seasonal 


$ trend t + $ii, 
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where i\ = {1,.,.,/ci} and i 2 = { 1 , - - -, A: 2 } provide up to k\ and k 2 time 
dependent periodic functions. I n l2 G {0,1} represents an element of the 
indicator matrix I with ki x k 2 dimension and k} tre nd is the trend coefficient 
which is constant over time. Moreover, the function set ff G {/■ S ] 1 J2 , Z/, 2 ,;-,}, 
where i G {ii,i 2 }, provides periodic external variables given by 
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where cos p and sin p are p-generalised periodic functions or Fourier, series if 
p = 2. Commonly, we observe the classical way to model the periodicity by 
Fourier series, like [10] and [20] show. Therefore, our first model uses the 
classical sine and cosine decomposition (p = 2) as well. Our second model 
uses the p-generalised sine and cosine functions of [17] (p > 0) to obtain a 
more flexible periodic structure. [18] mention that recently, the circle number 
Ti is generalised for l 2)P circles. Furthermore, [17] use the p-generalised radius 
coordinates to define p-generalised trigonometrical functions for p > 0. The 
p-generalised trigonometrical functions are given by [17] 
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where (p e M. Moreover, we have to describe the residual process e* which 
follows an ARFIMA-APARCH and is given by 


e t = <f>(B)(l-B) d X t = 0(B)Z t , 

Z, = a t rj t where rj t ~ 

cr 5 t = a 0 + E ai{\Z t -i\ - liZ t -i) 5 + E 

1=1 m= 1 


where d G (—0.5, 0.5) is the differencing parameter, B is the backward shift 
operator with B u X t = X t _ u . The polynomials 0 and 6 are (f>(B) = 1 — 
0i B — ... — (f>jB j and 0(B) = 1 + 6\B + ... + 6 q B q . 5 is the power of the 
APARCH process, 7 is the asymmetry parameter and {rjt} are the residuals. 
[5] point out that instead of the conditional variance, the conditional standard 
deviation with power 5 follows an APARCH. Wind speed data is highly 
volatile and different wind regimes may lead to the asymmetric conditional 
heteroscedasticity. Thus, if the wind speed is relatively low, we observe a 
different conditional variance compared to high speed wind regimes. The 
residuals {r] t } are described by a skew Student’s t-distribution F V:IJ (J ^. One 
definition of this distribution is given by [7]. They combine two halves of a 
symmetric t-distribution which are differently scaled. The skew-t distribution 
is given by 


fx(x) 




I( x > /j) + £ > 2 I(x < ft) 
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( 9 ) 


with v degrees of freedom, /(•) the indicator function, expectation /i, variance 
a 2 = v/(v — 2) and r(-) being the Gamma function. Moreover, £ is the 
skewness parameter with £ > 0. It reduces / to the noncentral Student’s 
t-distribution if £ = 1 . 



Finally, the model description conducts into two approaches, although both 
models incorporate interaction terms [13]. This idea is based on our data set, 
where we observe a high periodic structure. Initially, we apply the classical 
additive Fourier series decomposition, but there is still a periodic behaviour 
observable. Therefore, we take the periodic interaction into account. The in¬ 
teraction of diurnal periodic functions with the annual ones covers a flexible 
and time varying explanatory variable within the mean part of (1). Depend¬ 
ing on the dimension of the indicator matrix, we observe different periodic 
functions which interact with each other. According to our wind speed mod¬ 
els, the Bayesian information criterion (BIC) and the significance of different 
interactions result in the following indicator matrix 
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where I is a 5 x 5 matrix. The first element (In) is set to zero, because it is 
already covered by the intercept (2). Obviously, our obtained indicator ma¬ 
trix I excludes a few interactions. These are combinations of half-daily and 
half-yearly, half-yearly and diurnal as well as half-daily and annual periodic 
functions. Thus, we obtain the following seasonal term from Equation (3) 
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2.2 Parameter estimation and in-sample results 

Subsequently, we discuss the parameter estimation of both modelling ap¬ 
proaches. The first model contains classical sine and cosine functions, which 
is described by (2), (3) and ( 8 ). Our first model is estimated by a quasi 
maximum likelihood method, whereas the p-generalisation parameter of our 
trigonometric functions remains constant (i.e., p = 2). Unfortunately, the 
estimation procedure of the second model is more complex. Here, we have 
to solve nonlinear equations, which is related to the p-generalised periodic 
function (i.e. p > 0), to obtain suitable results. Therefore, we primarily 
estimate the intercept, trend and the seasonal coefficients as well as the pa¬ 
rameter p > 0 for the generalised sine and cosine functions by generalised 
least squares estimation. These values are only used as starting values. In the 
next step we estimate the entire second model by quasi maximum likelihood. 
According to the complete parameter space which might be included into 
our model, we have to select an information criterion which punishes over- 
parametrisation. We choose the Bayesian information criterion (BIG) to find 
the best model order as it is more conservative according to the parameter 
selection than, e.g., the Akaike information criterion (AIC). 
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As previously stated, the second wind speed prediction approach uses p- 
generalised sine and cosine functions (p > 0), which are shown in (4) and (5). 
For each periodic function /?, with i G {ii,i 2 }, i\ = { 1 , q*| 5 }, i 2 = { 1 , 
and s G {si,s 2 }, we include a sin p and cos p function in the second model. 
Thus, we have to estimate the parameter set p G {p s ,sin,Ps,cos} with p > 0 for 
each p-generalised trigonometrical function. Bearing in mind that a special 
case of this function is p SiS \ n = p s , cos = 2 and we do not estimate the parameter 
set p for our first model. 

Tables 1 and 2 provide the estimated parameters for each station. We show 
the parameter estimates for the model order which is obtained by the small¬ 
est BIC as well as the smallest in-sample mean square error. To evaluate the 
significance of a coefficient, we use a level of significance of a = 0.01. We 
have to test whether the p-generalisation coefficient p s , S m ^ 2 and p S:C os ^ 2 . 
Indeed, the p-coefficients are p s , S m 7 ^ 2 and p s , cos ^ 2. The estimated coef¬ 
ficients for all stations and both modelling approaches are significant, with 
few exceptions. Apparently, the diurnal and annual seasons are significant 
as well as the biannual and half-daily ones. Surprisingly, the fractional inte¬ 
gration parameter d is rejected at station Lindenberg for both models. Ac¬ 
cording to the long-memory parameter d G (—0.5, 0.5), we obtain different, 
but significant results for the other stations. The optimal model order for 
the ARFIMA(j,d,q) part in both cases is given by j = 2 autoregressive and 
q — 1 moving average parameters. The estimated APARCH coefficients are 
significant for a = 0.01, except for the APARCH intercept ao for Manschnow 
and Grunow. Here, the best model order of the APARCH process is given 
by Q = 2 and P = 1. Moreover, 7 is significant for all stations and models. 
According to the definition shown in ( 8 ), we observe negative estimates, but 
in fact positive shocks. This means that choppy wind increases the variance 
of the wind speed. The estimation results of the skew-t distribution indi¬ 
cate a left skewed distribution £ > 1 with degrees of freedom v > 2. While 
performing a partial t-test whether £ = 1 or alternatively £ ^ 1 , we have 
to reject the null-hypothesis for each station. Analysing v for all examples 
provides computations of 7 < v < 9.5. This result confirms the heavy tail 
assumption of the t-distribution. 
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[Table 1 and 2 about here.] 

Following the discussion of the parameter estimation, we analyse the auto¬ 
correlation function of the standardised residuals and the absolute value of 
the residuals with power 5. The results are illustrated in Figure 4 and 5. 
The histograms of the standardized residuals are shown and compared to 
the underlying skew-t distribution with skewness parameter £ and v degrees 
of freedom. Figure 4 provides satisfying results for the standardized residuals 
{rj t }, but with some limitations. One problem is obviously disclosed by the 
autocorrelation function (ACF) plot. The autocorrelation of the standard¬ 
ised residuals still show minor periodicity. In comparison to the absolute 
value of the lagged S residuals, the seasonality becomes apparent. According 
to Figure 5, the results are very similar to Figure 4. The histogram of the 
standardized residuals {rjt} indicates a variance reduction. Moreover, the 
autocorrelation function supports this hireling. The ACF of {rjt} provides 
slightly better results than the first model. Nevertheless, the ACF of {|} 
shows a remaining seasonality. 

[Figure 4 and 5 about here.] 

Finally, we compare the model diagnostics. The improvements of the second 
model that is indicated by the parameter estimation are emphasized by a 
lower BIC. In addition, the R 2 and the MSE provide better results for the 
second model. The second model discloses better results related to a minor 
presence of autocorrelation. We perform an autocorrelation test on the stan¬ 
dardized and the absolute values of the standardized residuals with power 5 
for different lags. The Ljung and Box Q-Statistics indicate a small correla¬ 
tion for some lags, given a level of significance of a = 0.05. Nevertheless, the 
remaining autocorrelation should be negligible, whereas conducting a test 
on the underlying skew-t distribution sustains the distributional assumption. 
Moreover, the histogram of the standardized residuals and the assumed un¬ 
derlying skew-t distribution point into the same direction. Summing up the 
model diagnostics, there is a multitude of arguments for the selected time 
series model, but with some limitations according to the periodic correlation. 
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3 Forecasting wind power 


The two aforementioned wind speed models are evaluated according to their 
out-of-sample performance. Therefore, we have to examine different accuracy 
measures. One of those criteria is the root mean square error (RMSE) and 
another one which we consider is the mean absolute error (MAE). In contrast 
to the RMSE and MAE, which evaluate the accuracy of the wind speed 
prediction, we analyse an alternative loss function. This loss function is 
becoming more common [23]. 

The main interest is the prediction of wind power [14]. The production of 
wind power strongly depends on factors like: air density p (1.25 kg/m 3 ), the 
rotor swept area A, the wind speed W t and an efficiency or power constant 
C p [?, see]]burton2001wind. The power production formula that models the 
relationship between wind speed and wind power is given by: 

Pow t = lC pP A{W t ) 3 , (11) 

where Pow t is the power production function at time t. Obviously, for each 
turbine type we obtain different power curves and a somehow different loss 
function. Therefore, assuming a typical power curve which holds for each 
region and all wind turbines is not suitable. At the German-Polish bor¬ 
der region for instance, the Fuhrlander MD 77 wind power plant is widely 
used. Considering Brandenburg altogether, we assess several different tur¬ 
bine types. They differ significantly with respect to their low cut-in wind 
speed, efficiency, rated output speed and cut-out speed. [10] introduce a spe¬ 
cific accuracy measure, the power curve error (PCE). Their aim is to point 
out the problem of wind power prediction. We use this power curve error 
with a specific power function shown in Figure 6. 

[Figure 6 about here.] 
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A common problem of the wind power prediction is that precise output is 
not available at each measurement station. Moreover, if wind power data 
is available, we only observe noisy observations. Therefore, we use (11) to 
approximate the real power data. The PCE loss function of [10] is given by 


L(W t+1 ,W t+l ) 


T(Pow(W t+1 ) - Pow(W t+1 )), 

(1 - r)(Pow(W t+1 ) - Pow(W t+l )), 


bbi+i < Wt+i 

W t+ 1 > W t+ 1 , 

( 12 ) 


where r is a weight with r G [0; 1]. In contrast to [10], we consider cases 
where underestimation is penalized stronger than overestimation, r < 0.5 
and the reverse case r > 0.5. The main aim of the introduced models is to 
minimize the respective loss function. According to the PCE, we want to 
find the quantile r minimizing the loss function. 

For the out-of-sample forecasts, we select incidentally different points in 
time k in the out-of-sample period n G {00 : 00 Januar 1,2012,..., 23 : 
50 December 31,2012}. The previously calculated in-sample model order 
is used as initial estimate. A part of the information set available at time 
k, namely W / k _ 2 2 o,ooo+i, •••, W K is used for the out-of-sample forecasts. We 
calculate 1 ''P k .+18|k and repeat this procedure 5,000 times to obtain suitable 
forecasting results up to a maximum of three hours. For each model type 
we use the so called persistence method [15] as a crude benchmark. Such 
a short-term forecast model uses the observation today as predictor for the 
next instance in time. The forecast for such a model is quite simple, but 
provides practicable results for high-frequency data. For the PCE, we take 
different values for r. [10] follow [16], who propose r = 0.73. In this case, 
underestimation is punished more strongly. Moreover, they take different 
values of r to control the overall estimation results and analyse the sensitiv¬ 
ity of their models. According to their findings, we decide to choose three 
different values of r G {0.25,0.5,0.75}. 

Table 4 gives an overview of predictability results of the three models. Pri¬ 
marily, we discuss the RMSE and MAE and subsequently consider the PCE. 
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Here, the best prediction model is obvious. In this case, the second model 
with generalised sine and cosine functions and ARFIMA-APARCH process 
is the best one for a three hour ahead prediction. We have to remark that 
at station Lindenberg, the fractional integrated part is insignificant. Even 
though the persistent model is hard to beat for predicting very short periods 
of time, the model fails to forecast up to three hours. 

[Table 4 about here.] 

Apparently, Table 4 provides an ambiguous overall result. Analysing the 
PCE changes our previous findings. As a result from the PCE comparison, 
we select the first model with ARFIMA-APARCH process and Fourier series 
in the mean part, if r = 0.25. The Erst model has still some minor model 
limitations related to the remaining information within the variance of the 
standardised residuals. The forecasting accuracy of model two is inferior to 
the first model. Still, it is better than the persistent model. This finding 
indicates that we have to choose the first model if overestimation of wind 
power is punished. This might be the case if the profit of a wind farm is 
calculated. 

In general, it is important for an investment into a wind farm to know how 
much wind on average is expected and how much wind power has to be 
produced. The investor is unwilling to receive losses according to an overes¬ 
timation of the expected wind power. The PCE for r = 0.5 indicates that 
the second model provides the best forecasting results. Here, the over- and 
underestimation is punished similarly. Therefore, it is not surprising that 
after our transformation into wind power the second model remains the best 
one. The third row shows the PCE for r = 0.75. Though the underestima¬ 
tion is penalized, the overall PCE that is shown in the last column provides 
only marginal differences between each model. According to our previous 
findings, the best prediction model changes from station to station and also 
from month to month. The initial best prediction model (Model 2) tends to 
overestimate the predicted wind power of the Fuhrlander MD 77. Punishing 
underestimation of wind power is related to the loss function of a system 
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operator. In case of underestimation, the system operator has to order too 
much power to match the demand. Subsequently, the operator has more 
power than needed, the generation is reduced and he faces costs of down- 
regulation. [16] point out that the costs of down-regulation are higher than 
those for up-regulation and lead to higher expenditures. 

Indeed, the results of the PCE are related to the underlying production func¬ 
tion shown in Figure 6. The Fuhrlander MD 77 is a small turbine which is 
used onshore. We are able to include various turbines to control the sensi¬ 
tivity of the obtained results. While considering different types of turbines, 
the PCE changes as well. Nevertheless, the turbines onshore situated at the 
German-Polish border are somehow comparable. Most of them have a sim¬ 
ilar cut-in wind speed, but a few of them have a higher rated and cut-out 
wind speed. While changing the wind power production function, the PCE 
changes in some cases. We use the power production function of a GE 1.6 
MW turbine to verify our previous findings. Although some results remain 
similar, we obtain different findings when r = 0.75. Now, the second model 
is marginally better than the other prediction models for forecasting wind 
power. 

The second model provides the best prediction results for the German-Polish 
border region. Moreover, it is related to the weight r which model is the best 
method to make predictions of the wind power. In particular, the persistent 
model is a good benchmark for wind power prediction, but for our wind 
speed prediction, it is outperformed. An onshore wind park operator who is 
interested in a lower bound of wind power prediction has to use the second 
model. 


4 Conclusion 

Recent research has covered a wide range of wind speed forecasting ap¬ 
proaches. In this paper, two novel statistical approaches for high-frequency 
wind speed data are described. Several extensions are implemented to cover 
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the specific structure of the data set. The mean part takes annual and di¬ 
urnal interactions into account. Moreover, we distinguish between a first 
model with classical Fourier series and a second model with p-generalised 
sine and cosine functions. The autocorrelation and the heteroscedasticity of 
both approaches is modeled by an ARFIMA-APARCH process with skew-t 
distributed residuals. The models are suitable to make three hour forecasts 
for wind speed and can be used for wind power prediction as well. 

According to RMSE and MAE, the developed approaches outperform com¬ 
mon models. The generalised trigonometrical functions provide the smallest 
accuracy measures according to the three hour ahead forecasts. Furthermore, 
we comprise the PCE which provides ambiguous results. The overall PCE 
provides the best results for the Erst model, if we consider a small weight. 
Nevertheless, the second model derives the best results if the weight is 0.5 
or 0.75. Finally, we remark that the PCE is also depending on the previ¬ 
ously specified turbine type. In this article, the models are applied to four 
stations situated at the German-Polish border. For this reason, the power 
production function of a Fuhrlander MD 77 is used to calculate the PCE. The 
Fuhrlander MD 77 is a small turbine which is often used onshore, especially 
in plain regions in Germany. The model robustness which is briefly discussed 
in the end, shows that some changes within the power curve function lead to 
different best prediction models. 

In practice, only the wind speed data set is needed. Therefore, such a statis¬ 
tical wind speed prediction model is cost saving compared to other modelling 
approaches that require a set of possibly many more variables. Clearly, there 
are further improvements of this approach possible. An idea is to analyse 
the variance in more detail and to consider meteorological regressors. Finally, 
the wind speed model might include spatial informations. 
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Figure 1: Wind speed sample of Lindenberg reported in 2011 and corre¬ 
sponding histogram. 





Figure 2: Wind speed autocorrelation function plot of Lindenberg. 
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Figure 3: Estimation of the spectral density at each single station. 


Table 1: Parameter estimation of the first model for all stations. 
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Table 2: Parameter estimation of the second model for all stations. 



Manschnow 

Lindenberg 

Angermiinde 

Griinow 

Regression coefficients 

Estimate 

Std. Error 

Estimate 

Std. Error 

Estimate 

Std. Error 

Estimate 

Std. Error 

trend 

-0.0000025*** 

0.00000006 

-0.0000040*** 

0.00000006 

-0.0000027*** 

0.00000007 

-0.0000028*** 

0.00000007 

^11 

3.2938*** 

0.0075 

4.0356*** 

0.0079 

4.1064*** 

0.0089 

4.6355*** 

0.0093 

$12 

-0.5387*** 

0.0051 

-0.1407*** 

0.0054 

-0.5227*** 

0.0060 

-0.5269*** 

0.0063 

$13 

-0.3987*** 

0.0056 

-0.1183*** 

0.0060 

-0.4452*** 

0.0066 

-0.4839*** 

0.0070 

^14 

0.0666*** 

0.0047 

0.0274*** 

0.0049 

0.0658*** 

0.0055 

0.0890*** 

0.0058 

$15 

0.2100*** 

0.0053 

0.1090*** 

0.0218 

0.2147*** 

0.0062 

0.1941*** 

0.0065 

$21 

0.0410*** 

0.0047 

-0.2354 *** 

0.0049 

-0.3147*** 

0.0055 

-0.1672*** 

0.0058 

$22 

0.1612*** 

0.0207 

0.0501 ** 

0.0218 

-0.2078*** 

0.0244 

-0.4095*** 

0.0255 

$23 

7.6052*** 

0.4566 

-1.6406 *** 

0.4824 

2.7394*** 

0.5387 

-1.4781*** 

0.5633 

$31 

0.5649*** 

0.0047 

0.3787 *** 

0.0050 

0.4958*** 

0.0056 

0.5853*** 

0.0058 

$32 

0.5948*** 

0.0107 

0.0797 *** 

0.0113 

0.5288*** 

0.0126 

0.4836*** 

0.0132 

^33 

0.4507*** 

0.0137 

0.1920 *** 

0.0144 

0.5904*** 

0.0161 

0.5665*** 

0.0168 

$41 

-0.1658*** 

0.0107 

-0.0067 

0.0113 

0.1490*** 

0.0126 

0.0890*** 

0.0058 

$51 

-0.1030*** 

0.0050 

-0.1517 *** 

0.0053 

-0.1093*** 

0.0059 

0.1941*** 

0.0065 

Pl2 

2.4709*** 

0.0104 

4.5886 *** 

0.0029 

1.9080*** 

0.0102 

2.3876*** 

0.0024 

P 13 

1.0264*** 

0.0058 

1.4824 *** 

0.0039 

1.1167*** 

0.0057 

1.0529*** 

0.0036 

P14 

1.3801*** 

0.0432 

51.3130 *** 

0.1569 

1.2694*** 

0.0091 

1.6574*** 

0.0026 

P15 

1.7776*** 

0.0189 

1.5631 *** 

0.0019 

1.9647*** 

0.0104 

4.2203*** 

0.0035 

P21 

0.3945*** 

0.0030 

0.4286 *** 

0.0009 

0.1752*** 

0.0002 

0.1566*** 

0.0002 

P31 

8.5535*** 

0.1220 

31.0516 *** 

0.1011 

30.9854*** 

0.0146 

19.7518*** 

0.0047 

P41 

0.5056*** 

0.0060 

76.5876 *** 

0.2387 

0.1535*** 

0.0004 

0.1404*** 

0.0001 

P51 

1.6385*** 

0.0485 

15.3874 *** 

0.0409 

6.9258*** 

0.0223 

0.1778*** 

0.0002 

ARFIMA parameters 

d 0.4475*** 

0.0024 

0.000001 

0.000008 

0.1006*** 

0.0126 

0.2564*** 

0.0087 

4> i 

1.2184*** 

0.0094 

1.4044 *** 

0.0011 

1.4961*** 

0.0183 

1.4318*** 

0.0016 

4> 2 

-0.2583*** 

0.0071 

-0.4116 *** 

0.0011 

-0.5048*** 

0.0172 

-0.4438*** 

0.0015 


-0.8081*** 

0.0215 

-0.6195 *** 

0.0047 

-0.7683*** 

0.0133 

-0.7919*** 

0.0014 

APARCH parameters 

a 0 0.0012* 

0.0006 

1.75 

0.0091*** 

0.0189*** 

0.0017 

0.0020* 

0.0011 

ai 

0.1392*** 

0.0051 

26.79 

0.1343*** 

0.1260*** 

0.0071 

0.1583*** 

0.0064 

01 

0.5188*** 

0.0463 

11.18 

0.7175*** 

0.5495*** 

0.0529 

0.5516*** 

0.0494 

02 

0.3738*** 

0.0437 

8.54 

0.1606*** 

0.3037*** 

0.0486 

0.3259*** 

0.0458 

71 

-0.1372*** 

0.0131 

-10.46 

-0.2863*** 

-0.1951*** 

0.0254 

-0.1086*** 

0.0188 

<5 

0.9594*** 

0.1076 

8.91 

0.9430*** 

1.3879*** 

0.0766 

1.0809*** 

0.0463 

€ 

1.0693*** 

0.0083 

128.16 

1.0517*** 

1.0549*** 

0.0085 

1.0158*** 

0.0079 

V 

7.8890*** 

0.2934 

26.88 

9.1984*** 

8.8649*** 

0.4010 

7.3345*** 

0.2829 

BIC 

*** 

1.2695 1.1185 

Significance at 1%, ** significance at 5% and * 

1.5616 

significance at 10%. 


1.3596 


23 



Density Density Density Density 


Station Manschnow 


Histogram 



Standardised estimated residuals 

Kernel: Skewed student's t-distribution 


Sample ACF 



Lag 

Station Lindenberg 


Sample ACF 



Lag 


Histogram 


Sample ACF 


Sample ACF 



<N 

O 


O 



Standardised estimated residuals 

Kernel: Skewed student's t-distribution 






Figure 4: Model 1: Histogram and ACF plot of { r /} and {|? 7 | 5 }. 


The figures show the histogram of the residuals { 77 ^} in the first column. The 
ACF of the residuals are provided in the second column, whereas the third 
column shows the ACF of {|^ 7 t| 5 }- 
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Figure 5: Histogram, ACF plot of q and \q\ s - second model 


The figures show the histogram of the residuals {f)t} in the first column. The 
ACF of the residuals are provided in the second column, whereas the third 
column shows the ACF of {|?7t| <5 }- 
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Figure 6: Power production function of a Fuhrlander MD 77 

The first zone reaches from 0 m/s to the cut-in wind speed of 3m/s. No 
power is produced within this area. The sigmoidal relationship of the power 
function (11) is used for the second zone. The rated output wind speed of 
13m/s and the rated output power of about 1500/cPF is the beginning of the 
third zone. From 13 m/s to 20m/s, the wind power plant is able to produce 
the maximum wind power. The rotor stops to produce power within the 
forth zone, beginning from the cut-out speed of 20m/s. 
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Table 4: Forecasting accuracy measures for station Manschnow and Lindenberg. The table provides the estimated 
forecasting measures for each model and station. The lowest of each measures is bold. 
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Model 2 PCE r = 0.75 25.39 16.74 34.88 13.95 8.43 21.76 13.52 15.19 11.43 13.51 21.93 15.17 17.66 

Persistent PCE r = 0.75 21.74 15.17 38.87 11.21 8.46 21.17 15.64 18.98 14.82 17.21 18.54 12.91 17.89 




Table 5: Forecasting accuracy measures for station Angermunde and Griinow. The Table provides the estimated 
forecasting measures for each model and station. The lowest of each measures is bold. 
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Model 2 PCE r = 0.75 35.18 42.24 31.12 21.73 21.68 21.54 21.97 25.79 33.63 32.61 31.49 21.14 28.34 

Persistent PCE r = 0.75 31.94 39.29 38.74 24.41 17.69 25.71 23.17 31.32 24.25 33.36 35.76 22.76 29.03 




